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Abstract 


We  propose  a  cutting  plane  algorithm  for  mixed  0-1  programs  based  on  a  family  of 
polyhedra  which  strengthen  the  usual  LP  relaxation.  We  show  how  to  generate  a  facet  of 
a  polyhedron  in  this  family  which  is  most  violated  by  the  current  fractional  point.  This 
cut  is  found  through  the  solution  of  a  linear  program  that  has  about  twice  the  size  of 
the  usual  LP  relaxation.  A  lifting  step  is  used  to  reduce  the  size  of  the  LP’s  needed  to 
generate  the  cuts.  An  additional  strengthening  step  suggested  by  Balas  and  Jeroslow  is 
then  applied.  We  report  our  computational  experience  with  a  preliminary  version  of  the 
algorithm.  This  approach  is  related  to  the  work  of  Balas  on  disjunctive  programming,  the 
matrix  cut  relaxations  of  Lovasz  and  Schrijver  and  the  hierarchy  of  relaxations  of  Sherali 
and  Adams. 

Key  Words:  Cutting  planes,  projection,  mixed  0-1  programming,  disjunctive  program¬ 
ming. 
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1  Introduction 


More  than  thirty  years  have  elapsed  since  the  emergence  of  cutting  plane  algorithms  for 
mixed  integer  programming,  but  branch  and  bound  is  still  the  prevalent  way  to  handle  such 
problems.  In  the  last  10-15  years  there  has  been  considerable  progress  in  using  combinatorial 
cutting  planes  for  certain  classes  of  pure  int^er  programming  problems,  such  as  the  symmetric 
traveling  salesman  problem,  in  combination  with  branch  and  bound  (see,  for  instance  [PR87]). 
The  success  of  this  approach,  known  as  branch  and  cut,  can  be  largely  attributed  to  the 
fact  that  the  combinatorial  cutting  planes  used  are  often  facets  of  the  underlying  integer 
polyhedron.  For  a  mixed  integer  program,  or  for  that  matter  a  general  pure  integer  program, 
facets  for  the  integer  polyhedron  are  not  easy  to  obtain.  For  pure  integer  programs,  one 
way  to  generate  deep  cuts  is  to  use  facets  of  the  knapsack  problems  obtained  by  considering 
each  constraint  separately.  This  approach  was  applied  successfully  by  Crowder,  Johnson  and 
Padberg  [CJP83]  to  pure  0-1  programs  without  special  structure.  A  similar  idea  was  used  by 
Van  Roy  and  Wolsey  [VW87]  for  mixed  0-1  programs. 

Another  way  of  strengthening  the  linear  programming  relaxation  of  an  int^er  program  is 
to  lift  the  problem  into  a  higher  dimensional  space,  where  a  more  convenient  formulation  may 
give  a  tighter  relaxation.  One  then  has  a  choice  between  working  with  this  tighter  relaxation 
in  the  higher  dimensional  space,  or  projecting  it  back  onto  the  original  space.  In  this  latter 
case,  the  whole  procedure  can  be  viewed  as  a  method  for  generating  cutting  planes  in  the 
original  space. 

One  such  procedure  was  recently  proposed  by  Lovasz  and  Schrijver  [LS89]  for  0-1  pro¬ 
grams.  The  higher  dimensional  space  they  use  is  obtained  by  multiplying  every  inequality 
by  every  0-1  variable  and  its  complement  in  turn,  then  linearizing  the  resulting  system  of 
quadratic  inequalities  and  finally  projecting  back  the  system  onto  the  original  space.  The 
lifting  phase  of  this  procedure  involves  a  squaring  of  the  number  of  variables  and  an  even 
steeper  increase  in  the  number  of  constraints,  but  iterating  the  lifting/projecting  step  a  num¬ 
ber  of  times  equal  to  the  number  of  original  0-1  variables  yields  the  convex  hull  of  feasible 
0-1  points. 

A  similar  Ufting/projecting  procedure,  which  obtains  the  integer  hull  in  a  non-iterative 
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fashion  through  simultaneous  multiplication  of  the  ori^nal  constraint  set  by  all  the  0-1  vari¬ 
ables  and  thmr  complements  followed  by  projection,  had  been  proposed  by  Sherali  and  Adams 
[SA88,  SA89]. 

In  this  paper  we  propose  a  lifting/projecting  procedure  where  the  original  constraint  set  is 
multiplied  by  a  single  0-1  variable  and  its  complement  before  projecting  back  onto  the  original 
space.  The  lifting  phase  of  our  procedure  involves  only  a  doubling  rather  than  a  squaring 
of  the  number  of  variables  and  constraints,  nevertheless  iterating  the  lifting/projecting  step 
as  many  times  as  the  original  number  of  0-1  variables  yields  the  convex  hull  of  feasible  0-1 
points,  as  in  the  Lovass-Schrijver  approach. 

We  then  show  that  our  iterated  procedure  is  equivalent  to  the  sequential  convexiiication 
procedure  for  facial  disjunctive  programs  (of  which  mixed  0-1  programs  are  a  special  case), 
introduced  by  Balas  [B74b,  B79]  in  the  seventies.  The  new  insight,  which  comes  from  re¬ 
discovering  a  previously  known  structure  from  an  entirely  different  perspective,  leads  us  to 
examine  a  class  of  finitely  convergent  cutting  plane  algorithms  for  mixed  0-1  programs  based 
on  the  iterative  lifting/projecting  procedure  outlined  above.  The  cutting  planes  generated  by 
the  procedure  are  facets  of  the  current  projected  polyhedron,  and  their  derivation  involves  the 
solution  of  a  linear  program  of  roughly  twice  the  size  of  the  original  problem.  The  objective 
function  of  this  linear  program  is  aimed  at  choosing  among  the  members  of  the  given  family 
of  cuts  a  deepest  one,  i.e,  one  that  cuts  off  the  optimal  vertex  of  the  current  relaxation  by 
more  than  any  other  member  of  the  family. 

The  paper  is  organized  as  follows.  Section  2  introduces  the  theory  behind  our  approach. 
Section  2.1  states  our  lifting/projecting  procedure  and  gives  its  main  properties.  Section  2.2 
compares  this  procedure  with  the  Lovasz  and  Schrijver  construction.  Section  2.3  sketches 
the  Sherali- Adams  results  and  relates  them  to  ours.  Section  2.4  shows  the  equivalence  of  our 
lifting/projecting  procedure  to  the  sequential  convexification  procedure  for  facial  disjunctive 
sets.  Finally,  Section  2.5  applies  our  procedure  to  the  stable  set  polytope  to  recover  some  of 
the  well-known  facet  inducing  inequalities. 

Section  3  discusses  a  class  of  cutting  plane  algorithms  based  on  the  material  of  Section 
2.  Section  3.1  outlines  the  approach  and  discusses  some  of  the  issues  and  options  that  arise. 
Section  3.2  gives  a  finiteness  proof  for  a  specialized  version  of  the  algorithm.  Section  3.3 
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shows  how  some  cutting  planes  can  be  generated  from  the  simplex  tableau,  and  Section  3.4 
discusses  a  lifting  step  used  to  reduce  the  size  of  the  LP’s  needed  to  generate  the  cuts.  Section 
3.5  applies  to  our  inequalities  the  strengthening  procedure  introduced  by  Balas  and  Jeroslow 
for  disjunctive  cuts. 

Finally,  Section  4  describes  our  preliminary  computational  experience  with  some  versions 
of  the  algorithm  discussed  under  Section  3.1.  The  preliminary  computational  experiments 
that  we  carried  out  indicate  that,  for  some  classes  of  problems,  a  relatively  low  number  of 
iterations  is  needed  to  find  the  optimum  or  get  close  to  it. 

2  Projection  and  Convexification 

Define 

K  :=  {z  €  li"  :  j4x  >  6,1  >  0,Xj  <  l,j  =  1, . .  .,p} 

:=  {x€  6) 

and 

;=  {i  €  if  :xj  €  {0,l),j=  l,...,p}. 

is  a  mixed  integer  set  with  n  variables,  p  of  which  are  0,  1  constrained.  K  is  the 
standard  linear  relaxation.  In  this  section  we  consider  procedures  that  yield  conv  K°  starting 
from  K. 

2.1  A  sequential  convexification  procedure 
0.  Select  an  index  j  €  {1, . . . ,  p}. 

1.  Multiply  Ax  >b  toith  1  -  Xj  and  Xj  to  obtain  the  nonlinear  system 

{l-Xj)(Ax-b)  >  0 

Xj{Ax  ~b)  >  0  (1) 

2.  Linearize  (1)  by  substituting  y,-  for  XiXj,i  =  l,...,n,i  j/i  j,  and  xj  for  xj.  Call  the 
polyhedron  defined  by  the  resulting  system  Mj{K). 
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S.  Project  Mj{K)  onto  the  x-space  by  eliminating  yi,i~  l,...,n,i^  j.  Call  the  resiUting 
polyhedron  Pj{K). 

Note  that,  if  the  system  defining  K  has  m  constraints  and  n  variables,  the  system  defining 
Mj(K)  has  2m  constraints  and  2n  —  1  variables.  It  is  clear  that  K°  C  Pj(K)  and  that 
Pj(K)  C  K.Jn  fact,  we  have: 

Theorem  2.1  PjiK)  =  conv(K  n  {*  €  ti”  :  xj  €  {0, 1}}). 

Since  any  x  that  satisfies  Ax>h  and  0  <  z ^  <  1  clearly  satisfies  both  (1  - Xj)(Ax  -6)  >  0 

A*  ^ 

and  Xj{Ax  —  A)  >  0,  the  multiplications  performed  in  Step  1  above  are  not  responsible  for 
tightening  the  constraints  of  K.  Farther,  replacing  XiXj  with  y,-  for  all  i  ^  j  in  Step  2  above 
cannot  tighten  those  constraints  either.  Yet,  unless  the  0-1  constraint  is  redundant  for  variable 
j,  the  projection  Pj{K)  of  the  set  Mj(K)  resulting  from  Step  2  is  strictly  contained  in  K. 
The  only  operation  that  is  "accountable”  for  this  tightening  is  the  replacement  of  the  terms 
x^  by  xj.  Indeed,  while  this  substitution  does  not  eliminate  any  points  for  which  Xj  e  {0, 1), 
it  does  cut  off  points  x  with  0  <  z j  <  1. 

For  t  >  2,  define  P, . i,(K)  =  Pi,{Pi,., .  ..{Pi,{K))  • .  •)• 

Theorem  2.2  For  any  f  =  1, . .  .,p, 

=  conv{K  n  {z  €  »"  :  Xj  €  {0,  l),i  =  1, . 

Corollary  2.3  P\,...,p{K)  ~  convK°, 

Proof  of  Theorem  2.1. 

ii)Pi{K)  C  conv(Jir  n  {z  :  Zj  €  {0,1}}). 

First  assume  iif  n  {z  :  Zj  =  0}  =  0,  Then  Zj  —  c  >  0  is  valid  for  K  for  some  c  >  0.  This 
implies  that  (1  -  Zj)(zj  -  e)  >  0  is  satisfied  by  any  z  that  satisfies  (1).  Repladng  z^  by  Xj, 
it  follows  that  Zj  >  1  is  valid  for  Mj{K)  and  for  Pj{K).  This,  together  with  PjiK)  C  K, 
implies  PjiK)  C  K  r\  {x  :  Xj  =  1}  and  hence  (t). 
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Similarly,  'd  K  r\{x  :  Xj  =  1}  =  0,  xj  <  0  is  valid  for  PjiK),  and  again  (t)  follows. 

Assume  now  that  K  r\{x  :  Xj  =  0}  0  and  K  Ci  {x  :  Xj  =  1}  ^  0,  and  let  ai  >  /?  be  a 

valid  inequality  for  conv(if  D  {x  :xj  e  {0,1}}).  Since  ax  >  0  is  valid  for  A"  n  {x  :  Xj  =  0}, 
there  exists  A  >  0  such  that  ax  +  Xxj  >  /9  is  valid  for  K.  Furthermore,  since  ax  >  ^  is  valid 
for  A*  n  {x  :  Xj  =  1}  there  exists  some  ft  >  0  such  that  ax  +  ji(l  —  Xj)  >  ^  is  valid  for 
K.  Now  since  ax  +  Axj  —  /3  >  0  and  ax  +  /i(l  —  Xj)  >  ^  are  valid  for  A,  the  inequalities 
(1  -  Xj)(ax  +  Axj  -  /9)  >  0  and  Xj(ai  +  /i(l  —  xj)  -  ^)  >  0  are  satiiied  by  any  x  that  satisfies 
(1).  Adding  these  two  inequalities  yields  ax  +  (A  +  /i)(xj  —  x})  —  j9  >  0  and,  after  setting 
Xj  =  Xj  ,  ax  -  >  0  .  Hence  ax  -  >  0  is  valid  for  Mj(A)  and  for  Pj(K). 

(it)  conv(A  n  {x  :  Xj  €  {0,1}})  C  Pj(K). 

To  prove  this,  let  ax  >  be  a  valid  linear  inequality  for  Pj(K).  Then  ax  >  is  dominated 
by  some  inequality  of  the  form 

u(Ax  -  6)(1  —  Xj)  + t;(Ax  -  6)xj  >  0  (2) 

where  «,  v  >  0  are  row  vectors  such  that  all  terms  XjXj,  i  ^  j,  cancel  out,  and  where  Xj  is  to 
be  substituted  everywhere  for  x^.  But  any  such  inequality  becomes  u{Ax  -  6)  >  0  if  Xj  =  0, 
and  v(Ax  —  b)  >  0  if  Xj  =  1.  Thus  (2)  is  valid  for  A  n  {x  :  Xj  :=  0}  and  for  A  n  {x  :  Xj  =  1}; 
i.e.,  (2)  and  hence  ax  ^  ^  is  valid  for  conv(A  D  {x:xj  e  {0,1}}).  ■ 

Proof  of  Theorem  2.2.  For  f  €  {1, .  ••,?},  let  Ft  :=  {x  :  Xj  €  {0, 1}  for  j  =  !,...,<}.  We 
use  induction  on  t.  For  t  =  1,  the  result  follows  from  Theorem  2.1.  Suppose  the  result  holds 
for  t  =  1, . . . ,  g  -  1  and  let  t  =  g,  2  <  g  <  p.  Then 

Pi . ,(A)  =  P,(conv(A'  n  A,-i))  =  conv(conv(A  n  A,_i)  n  {x  :  x,  €  {0,1}}) 

where  the  first  equation  is  implied  by  the  induction  hypothesis,  while  the  second  follows  from 
Theorem  2.1.  The  last  expression  can  be  rewritten  as 

conv((conv(A  n  A,.! )  n  {x  :  x,  =  0})  U  (conv(A  n  A,_i)  D  {x  :  x,  =  1}))  (3) 

Now  let  5  C  R"  and  let  A  :=  {x  €  Ji"  :  ax  =  13]  be  a  hyperpiane  such  that  ax  >  0  for 
all  X  €  5.  We  make  the  following 

Claim  .  H  n  convS  =  conv(S  D  If). 
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Proof.  X  £  H  r\  conv5  if  and  only  H  ax  s  /}  and  x  =  y*A,-  for  some  finite  T  and 

y,-  €  S,Xi  >  0, «  €  T,  snch  that  “1^*  ^  ^  T.  But  ax  =  0  and 

*  €  T,  imply  ay*  =  /3,  i  €  T,  hence  x  €  H  convS  x  6  conv(5  n  H).  ■ 

Applying  this  result  to  (3)  and  using  the  fact  that  for  any  5,7'  €  3^,conv(conv5  U 

convT)  =  conv(5  U  T)  we  obtain 

=  conv(A'  n  f,.!  n  {x  :  X,  =  0})  U  (if  n  n  {x  :  x,  =  1})) 

=  conv(Jf  n  Ff-i  n  {x  :  x,  €  {0, 1}}) 

=  conv(iL’  n  7’,).  ■ 

Corollary  2.3  follows  immediately  from  Theorem  2.2  upon  substituting  p  for  t.  Another 
consequence  of  Theorem  2.2  is  the  following. 

Corollary  2,4  Pi(Pj(K))  =  Pj{Pi{K))y  for  iyj  €  {1, . . .  ,p},  *  j. 

2.2  The  Lovasz-Schrijver  Construction 

1.  Multiply  Ax  >b  with  xj  and  1  -  Xj,j  s  1, . .  .,p,  to  obtain  the  nonlinear  system 


(1 -x,)(Ax-{i)  >  0 
xi(Ax  -  6)  >  0 
(1  — xjXAi  — 6)  >  0 

X2(Ax  -  6)  >  0  (4) 

(l-Xp)(Ax-6)  >  0 

Xp(Ax  —  6)  >  0 

2.  Linearize  (4)  by  substituting  y,j  for  x,Xj,i  =  1, . . . ,  n,  j  =  1, . . . ,  p,  t  ^  j  and  xj  for  xj  , 
j  =  1, . .  .,p.  Call  the  polyhedron  defined  from  the  resulting  system  M{K). 

3.  Project  M{K)  onto  the  x-space  by  eliminating  y,-^,*  =  1, . .  .,n,j  =  1, . .  .,p,i  j.  Call 
the  resulting  polyhedron  N{K). 


6 


Note  that,  if  the  system  defining  K  has  m  constraints  and  n  variables,  of  which  p  are  0-1 
constrained  in  K°,  the  system  defining  M(K)  has  2pm  constraints  and  pn  +  n  -  p  variables. 
Lovdsz  and  Schrijver  have  shown  that  N(K)  has  the  following  properties: 

Theorem  2.5  N{K)  C  eonv(K  fl  {x  €  :  Xj  €  {0, 1}})  ,  j  =  1, . .  .,p. 

Let  N\K)  =  NiK)  and  N\K)  =  N(N*-\K)),  for  t  >  2. 

Theorem  2.6  N^{K)  =  amv  AT®. 

In  other  words,  iterating  the  above  procedure  p  times  yields  the  integer  hull. 

Theorem  2.1  implies  Theorem  2.5,  since  N{K)  C  Pj{K). 

Corollary  2.3  impUes  Theorem  2.6.  Again,  this  follows  from  N{K)  C  Pj{K),j  —  1, . . . ,p. 

Note  however,  that  the  Lovisz  and  Schrijver  relaxation  N{K)  is  not  only  stronger  than 
PjiK)  for  any  j,  but  also  stronger  than  the  inclusion  N(K)  C  d^-iPjiK)  can  be 

strict. 


2.3  The  Sherali  -  Adams  Construction 

Somewhat  earlier  than  Lovasz  and  Schrijver,  Sherali  and  Adams  had  proposed  a  similar 
convexification  procedure  [SA88]. 

Let  K  and  be  defined  as  above,  and  let  t  €  {1, . .  .,p}. 

1.  Multiply  Ax  >b  with  every  product  of  the  form  [fljeJi  ®i]  [nj€Ja(l  “  *i)]  »  *^1 

and  J2  are  disjoint  subsets  of  {1, . .  .,p}  such  that  |  Ji  U  J3I  =  t.  Call  the  resulting  non¬ 
linear  system  (NLi). 

2.  Linearize  {NLt)  by  (t)  substituting  Xj  for  Xj;  and  (it)  substituting  a  variable  wj  for 
every  product  IljeJ®'*  where  J  C  {l,...,p},  and  vjk  for  every  product 

where  J  C  { 1 , . . . ,  p}  and  k  €  {p  + 1, . . . ,  n} .  Call  the  polyhedron  defined  by  the  resulting 
system  Xt. 
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S.  Project  Xt  onto  the  x-space  by  eliminating  all  wj  and  vjk.  Call  the  resulting  polyhedron 

Kt. 


It  is  easy  to  see  that  K^CKpC...cKiCK.hi  addition,  Sherali  and  Adams  proved 
the  following: 

Theorem  2.7  [5^488,  SA89]  Kp  =  eonvK^. 

Next  we  prove  a  result  which  shows  that  Theorem  2.7  also  follows  from  Theorem  2.2. 
Theorem  2.8  For  <  =  1, . . .  ,p,  /ft  C  . 

Proof.  Let  A^x  >  denote  the  linear  system  describing  Pi,...j{K),  for  j  =  l,...,t.  Let 
az  >  be  one  of  the  inequalities  defining  Pi,..^i{K).  Then  ai  >  can  be  obtained  by 
taking  a  nonnegative  linear  combination  of  the  inequalities  (1  -  xt){A*~^x  —  b*~^)  >  0  and 
—  6*"*)  >  0,  with  multipliers  that  eliminate  the  nonlinear  products  /  t 

and  substituting  x^  by  *«.  By  the  same  argument  every  inequality  of  the  system  A*~^x  — 
b*~^  >  0  can  be  obtained  by  taking  a  nonnegative  linear  combination  of  ihe  inequalities 
(1  —  —  6*“*)  >  0  and  >  0,  with  multipliers  that  eliminate  all 

products  ^  t  —  \  and  setting  Z|_}  =  «t-i*  By  inductivdy  repeating  this  argument 

we  can  obtain  az  >  in  terms  of  the  inequalities  of  INLt),  by  first  substituting  Xj  by 
j  —  1? •  •  a>nd  then  eliminating  the  remaining  nonlinear  terms  using  as  multipliers  the 
product  of  the  multipliers  used  in  each  step  of  the  induction.  Therefore  ax  >  fits  valid  for 
Kt  and  the  result  follows.  ■ 

Now  Theorem  2.7  follows  from  Corollary  2.3  and  Theorem  2.8.  It  also  follows  from 
Theorem  2.6  and  a  proposition  in  [LS89]  that  shows  that  Kt  C  N*{K). 

2.4  The  Connection  with  Disjunctive  Programming 

The  results  of  Section  2.1  are  closely  related  to  results  obtained  earlier  in  the  context  of 
disjunctive  programming,  i.e.  optimization  over  unions  of  polyhedra.  The  first  of  these  is  the 
following  basic  lifting  theorem  for  unions  of  polyhedra: 
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Theorem  2.9  [B74b,B85]  Let  Hi  {z  €  S2"  :  A*x  >  6'},t  e  Q  be  a  finite  set  of  nonempty 
polyhedra.  Then  conv(U{eQni)  is  the  set  of  those  z  €  S?”  for  which  there  exist  vectors 
(y'»»o)>*  €  Q,  such  that 

*  -IZi€«y‘  =  0 

MV  -U  >  0 

Vi  >  0 

Vo  —  ^ 

Theorem  2.9  assumes  11,-  0,  t  €  Q.  If  =  0  for  some  k  €  Q,  the  theorem  is  still  valid 

[B85]  if  the  following  regularity  condition  holds: 

j4*z  >  0  =>  z  =  ^  y*  for  some  t  €  Q\{fc}  such  that  H,-  ^  0  and  j4*y‘  >  0. 


Next  we  show  that  Theorem  2.9,  specialized  to  the  case  when  |Q|  =  2  and 
•  Hi  :=  A"  n  {z  :  Zj  =  0},  112  :=  A  n  {z  :  =  1} 

yields  Theorem  2.1.  Indeed  in  this  case  (5)  becomes 


z  -z 

-y 

s 

0 

Az 

—bzo 

> 

0 

= 

0 

Ay 

-ho 

> 

0 

Vi 

-yo 

=: 

0 

20 

+yo 

= 

1 

2o,yo 

> 

0 

It  is  easy  to  see  that  the  above  regularity  condition  is  satisfied  for  (6),  since  the  matrices 
A*  associated  with  z  and  y  are  the  same,  namely  A. 

On  the  other  hand,  Pj(K)  is  the  projection  onto  the  z-space  of  Mj{K),  the  polytope 
whose  defining  system  is  obtained  from  (1)  by  setting  y,-  :=  z,Zj,t  =  1, . . .,n,yj  :=  Xj  =  z^. 
The  result  of  these  operations  is: 

Ax  ~b  —  Ay  +  bxj  >  0  (7) 

.4y  -  bxj  >  0 
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If  we  now  define  jto  :=  Xj,z  :=  x  —  y,2o  :=  1  —  xj,  (7)  can  be  rewritten  as  (6),  where  the 
equation  Zj  =  0  follows  from  yj  =  Xj,Zj  =  xj  —  yj;  and  the  equation  yj  =  Vo  follows  from 

yj  =  Xj  =  Jfo. 

Since  Theorem  2.1  asserts  that  conv(IIi  U  IIj)  is  the  projection  on  the  x-space  of  the 
polytope  defined  by  (6),  it  is  a  specialization  of  Theorem  2.9  to  this  case. 

An  alternative  characterization  of  the  convex  hull  of  a  union  of  polyhedra,  also  obtained 
in  the  context  of  disjunctive  programming,  is  contmned  in  the  following  theorem,  which  will 
play  an  important  role  in  the  cutting  planie  algorithm  of  Section  3.  We  state  the  result  as  it 
applies  to  Pj(A')  (=  conv(ni  U  IIj)). 

Theorem  2.10  [B746,P79] 

PjiK)  =  {x  €  :  oti  >  /?  for  all  (o,/J)  €  P/ (A)}, 

where  Pf{K)  is  the  set  of  those  (a,;8)  €  for  which  there  exist  vectors  u,v  e 
and  uo,  «o  €  R  satisfying: 


a  -uA  -uo€j 

s 

0 

a 

—vA 

-voej 

0 

ub 

> 

0 

vb 

+Vo 

> 

0 

u,v 

> 

0 

where  Cj  is  the  unit  vector  in  R". 

Further,  if  K  is  a  full  dimensional  polyhedron  and  II]  7^  0  ^  IIj,  there  is  a  1-1  correspon¬ 
dence  between  facets  of  PjiK)  and  extreme  points  of  PJ{K)0,  the  polyhedron  obtained  from 
the  cone  Pf{K)  by  setting  0  =  1  or  0  =  -1. 

Next  we  turn  to  the  sequential  convexification  theorem  for  facial  disjunctive  sets.  If  IT  is 
a  polyhedron  containing  the  polyhedra  Ili,t  €  <?,  then  the  disjunctive  set  S  :=  Uiegll,'  can 
be  written  in  conjunctive  normal  form  as 

S  =  {xen:  V  d‘x  >dS,h=  1,...,?},  (9) 
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where  |QaI  =  IQ  I  for  aU  h,  and  each  disjunction  h  contains  exactly  one  inequality  from  the 
system  defining  each  11,-. 

The  disjunctive  set  S  is  called  facial  if  each  inequality  d'‘x  >  dQ,k  €  Qhjf*  = 
defines  a  face  of  II. 

Theorem  2.11  [B7Ab,B79]  Let  S  be  defined  by  (9),  Let  Sq  :=  11  and  for  h  =  let 

Sh  :=  conij(Sfc_i  n  >  dj}). 

If  S  is  facial,  then  5,  =  conv  5. 

When  S  is  of  the  form 

K°  :=  {x  ^  K  :  Xh  =  OV  Xh  =  l,h- 
it  is  clearly  facial,  and  thus  Theorem  2.11  specializes  to  Theorem  2.2. 

While  faciality  is  a  sufficient  condition  for  the  theorem  to  hold,  it  is  not  necessary.  A 
necessary  condition  was  given  in  (BTT89]. 

We  will  say  that  an  inequality  ax  >  valid  for  A®,  has  disjunctive  rank  r  if  r  is  the 
smallest  integer  such  that  there  exists  a  subset  of  {l,...,p),  such  that  az  >  is 

valid  for  . 

In  the  case  of  a  pure  0-1  programming  problem  it  is  interesting  to  compare  the  disjunctive 
rank  of  an  inequality  with  its  Chvatal  rank. 

Let  A  =  {z  e  :-2zi  +  Z2<0,  2zi  +  *2<2,  0  <  Xj<l,j  =  1,2},  with  p  =  2. 

It  follows  from  Theorem  2.1  that  the  inequality  zj  <  0  has  disjunctive  rank  1,  but  it  is 
easy  to  verify  that  it  has  Chvatal  rank  2. 

Since  the  disjunctive  rank,  according  to  Theorem  2.2,  never  exceeds  the  number  of  vari¬ 
ables,  whereas  no  such  upper  bound  is  known  for  the  Chvatal  rank,  one  might  expect  the 
disjunctive  rank  to  always  be  less  than  or  equal  to  the  Chvatal  rank.  This,  however,  is  not 
the  case,  as  will  be  illustrated  in  the  next  section. 
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2.5  Application  to  the  Stable  Set  Polytope 

A  stable  set  (independent  set,  vertex  packing)  in  a  graph  G  :=  {V,E)  is  a  subset  S  C  V 
such  that  no  two  vertices  of  5  are  adjacent.  The  stable  set  polytope  is  the  convex  hull  of  the 
incidence  vectors  of  stable  sets  in  G  : 

S{G)  :=  conv{a:  €  {0, 1}"  :  Xi  +  Xj  <  1,  V(i,  j)  €  E} 

The  linear  programming  relaxation  of  S{G) 

FSiG)  :=  {*€»!{.:  *.  +  <  1,  V(x,i)  €  £}, 

sometimes  called  the  fractional  stable  set  pol}rtope,  strictly  contains  5(G)  whenever  G  is  not 
bipartite.  Facets  of  S(G)  include  the  odd  hole  and  clique  inequalities.  For  W  C  V,G\^ 
denotes  the  subgraph  of  G  induced  by  For  r  €  V,r(t))  denotes  the  set  of  vertices 

adjacent  to  v.  Given  G,  deleting  v  and  contracting  v  are  defined  as  replacing  G  with  G\{v} 
and  with  G\({v}  U  r(v)),  respectively.  Clearly  these  operations  correspond  to  setting  Zv  =  0 
and  =  1  in  5(G). 

If  ax  <  6  is  a  valid  inequality  for  5(G)  we  say  that  the  inequalities 

OjXj  <  b  and  ajXj  <  6  -  o„ 

j€V\{v}  i€V\({„}ur(v)) 

are  obtained  from  ax  <  6  by  the  deletion  and  contraction  of  v,  respectively. 

The  following  two  properties  are  shown  by  Lovisz  and  Schrijver  [LS89]  to  hold  for  the  set 
N(FSiG)).  Here  we  show  them  to  also  hold  for  the  larger  sets  Pj{FS{G)). 

Lemma  2.12  If  ax  <  b  is  a  valid  inequality  for  5(G)  and  there  exists  j  €  V  such  that  the 
inequalities  obtained  from  ax  <b  by  deletion  of  j  and  contraction  of  j  are  valid  for  FS{G\{j}) 
and  FS{G\({j}  U  r(j))),  respectively,  then  ax  <  b  is  valid  for  Pj{FS{G)). 

Proof.  Follows  from  the  fact  that  (Theorem  2.1)  Pj{FS{G))  =  conv(F5(G)  n  {x  :  Xj  € 

{0,1}})  ■ 

For  the  purpose  of  this  discussion,  an  odd  hole  in  G  is  defined  as  a  chordless  cycle  of  odd 
length  (i.e.  triangles  are  included). 
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Theorem  2.13  Let  C  CV  induce  an  odd  hole  in  G.  Then  the  odd  hole  inequality 


< 


|C|-1 

2 


u  valid  for  Pj{FS{G))  for  any  j  €  C. 


Proof.  Let  j  €  C.  The  inequalities  obtained  from  the  odd  hole  inequality  by  deleting  and 
contracting  j  are  valid  for  FS{G\{j})  and  FS(G\{{j}  U  r(j)))  respectively,  since  the  sub¬ 
graphs  of  G  induced  by  C\{j}  and  C\({i}  U  r(j)))  are  both  bipartite.  Hence  from  Lemma 
2.12,  the  odd  hole  inequality  is  valid  for  Pj(FS(G)).  ■ 

Thus  the  odd  hole  inequalities,  which  have  Chvatal  rank  1,  also  have  disjunctive  rank  1. 


CoroUsiry  2.14  All  odd  hole  inequalities  are  valid  for  the  polytope 

PiFS{G))-f)Pj{FSiG)). 

i€V 


Proof.  Follows  from  Theorem  2.13.  ■ 

As  mentioned  in  Section  2.2,  the  Lovasz  and  Schrijver  relaxation  N(K)  can  be  stronger 
than  the  intersection  of  the  relaxations  Pj{K).  In  our  case,  this  would  imply  that  the  inclusion 
N{FS{G))  C  P{FS{G))  is  strict.  This,  however,  is  not  the  case;  i.e.  for  the  stable  set 
problem  the  two  relaxations  are  the  same.  Lovasz  and  Schrijver  [LS89]  have  characterized 
N{FS{G))  as  precisely  the  polytope  defined  by  the  inequalities  defining  FS{G)  and  the  odd 
hole  inequalities. 

Proposition  2.15  P(FS(G))  =  N(FS(G)). 

Proof.  The  inclusion  N{FS(G))  C  P(FS(G))  has  already  been  discussed.  The  inclusion 
P{FS{G))  C  N{FS{G))  follows  from  the  fact  that  P(FS(G))  satisfies  all  the  odd  hole  in¬ 
equalities.  ■ 

Another  well  known  class  of  valid  inequalities  for  the  stable  set  polytope  is  that  associated 
with  cliques,  i.e.,  the  sets  of  pairwise  adjacent  vertices. 
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The  clique  ^.equality 


j€K 

where  K  CV  is  a.  clique  of  G,  is  known  to  induce  a  facet  of  S(G)  if  and  only  if  the  clique  K 
is  (inclusion-)  maximal.  Clique  inequalities  are  known  to  have  Chvital  rank 

Theorem  2.16  For  any  clique  K,  the  clique  inequality  Yii^K  ^  ^  disjunctive  rank 
\K\-2. 


Proof.  First  we  show  by  induction  that  the  rank  of  <  1  is  at  most  |^|  —  2.  For 

lA*!  =  3  the  result  follows  from  Lemma  2.13.  Now  suppose  the  result  holds  for  every  clique 
K  such  that  \K\  <  k  and  let  K*  be  a  clique  of  size  Jb  -1- 1.  Let  j  €  K'  and  K  =  K'\{j}.  By 
the  inductive  hypothesis,  the  inequality  ^  1*  obtained  from  SIteK’'  ^  ^  deletion 

of  j,  has  rank  at  most  k,  i.e.  there  exists  C  V  such  that  the  inequality  is  valid 

for  Pii,...,i,,(FS(G)).  Also,  the  inequality  obtained  from  *•  ^  1  ^y  contraction  of  j  is 

0  <  0  and  also  valid  for  Pii,...,ii,(FS(G)).  Hence,  by  Theorem  2.1, S  1  is  valid  for 
j(f’5(G))  and  hence  the  disjunctive  rank  of  ]C«eK'  *»  <  1  is  at  most  k  +  1  =  \K'\. 

To  prove  that  the  disjunctive  rank  of  <  1  is  exactly  1A|  -  2,  suppose  it  is  s  < 

|A|-3.  Then,  there  exists  {»i, . .  • ,  t,}  C  V  such  that  the  inequality  is  valid  for  ,aFS{G)). 
It  then  foUows  from  Theorem  2.1  that,  whether  €  A  or  t,  ^  A,  the  inequality 

i€K\{i.} 

is  valid  for  Pi,,...,i,_i(F5(G\{t,})).  Applying  this  reasoning  recursively  to  A\{t,},  A\{»„ 
etc.,  the  inequality 

with  |A\{ii,. . >  3  is  valid  for  F5(<?\{ii,...,»,}),  a  contradiction.  ■ 

Thus  the  disjunctive  rank  of  cliques  inequalities  for  cliques  of  size  >  5,  is  larger  than  their 
Chvdtal  rank. 
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3  Some  Cutting  Plane  Algorithms 


3.1  The  General  Procedure 

In  this  section  we  discuss  cutting  plane  algorithms  for  mixed  0-1  programs  based  on  the 
sequential  convexification  procedure  of  Section  2.1.  In  particular,  we  address  the  problem 

mtn{cz  :x  e  K,Xje  {0,1},  j  =  l,...,p},  (MIP) 

where,  as  before,  =  {i  €  R"  :  A*  >  6}. 

We  wish  to  use  facets  of  Pj{K)  as  cutting  planes.  For  this  purpose  we  will  generate 
inequalities  ax  >  ^  such  that  (a,^)  is  an  extreme  ray  of  the  cone  Pj{K)  of  Theorem  2.10. 
This  can  be  done  by  solving  a  linear  program  of  the  form 

Tnax{aa  + b0  :  {a,/})  £  Pj(K)n  S},  (10) 

where  (a,  6)  6  is  a  vector  that  determines  the  direction  of  the  cut,  Pj{K)  is  the  polyhe¬ 

dral  cone  defined  by  (8),  while  5  is  a  “normalization”  set  defined  by  one  or  more  constraints 
meant  to  truncate  the  cone  Pj{K). 

The  general  outline  of  such  a  procedure  is  as  follows: 


0.  /  :=  l.K^  :=  /r  =  {i  €  »"  :  Ax  >  6). 

1.  Find  cx*  m»n{cx  :  x  e  K*}. 

//  xj  €  {0, 1}  /or  j  =  1, . . . ,  p,  stop. 

2.  Forj  e  {1, . .  .,p}  such  that  0  <  x*-  <  1,  find 

:=  7nax{a*o  -|-  b*/3  :  €  Pj{K*)  n  5). 

3.  Define  by  adding  to  the  constraints  of  if*  the  cuts  afx  >  0^  generated  in  Step  2 
(and  perhaps  removing  some  cuts  added  earlier). 

4.  Set  t  :=  t  -f  1  and  go  to  1. 
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There  are  several  options  for  choosing  the  set  S  and  the  vector  (a*,  6^)  in  Step  2. 

Normalization  1  :  We  say  that  (a,/3)  €  Pj(K*)  defines  a  deepest  cut  if  it  maximizes  the 
(Euclidean)  distance  between  x*  and  the  hyperplane  ax  =  0. 

Maximizing  the  distance  between  x*  and  ax  =  0  is  the  same  as  maxti-.izing  the  distance 
between  x*  and  its  orthogonal  projection  on  ax  =  0,  which  isx  =  x*^Xa  for  some  A  >  0.  Since 
ax  =  0  is  the  same  as  Aoz  =  X0  w.l.o.g.  we  can  take  A  =  1.  Thus  x  —  x*  =  a.  Furthermore, 
0  —  ax*  —  ax  —  ax*  =  oo.  Thus  a  deepest  cut  is  obtained  for  the  vector  {o,*,b*)  =  (— **,1) 
and  the  set  S  :=  {(o!,/3)  :  0  —  ax*  =  ad}.  However,  the  resulting  problem  (10)  is  not  a 
linear  program,  as  the  equation  defining  S  is  quadratic.  Thus  we  are  led  to  consider  some 
alternatives  to  this  "optimal”  normalization. 

We  continue  to  use  (a*,b*)  =  (—*‘,1)  so  that  the  objective  in  (10)  remains  to  maximize 
the  amount  by  which  the  point  x*  violates  the  cut  ax  >  0.  We  consider  the  following  options 
for  5 ; 

Normalization  2  :  One  may  simply  require  that  /3  =  lor/9=:-l.  In  many  problems  it  is 
easy  to  decide  whether  one  should  want  a  cut  ax  >  0  with  /3  >  0  or  /?  <  0.  One  advantage 
of  this  approach  is  that  the  linear  program  (10)  to  be  solved  in  Step  2  can  be  reduced  by 
eliminating  the  variables  o,-,  *  =  1, . . .,«  from  the  system  (8)  defining  F-{K).  Thus  for  /?  =  1 
or  0  =  —\  ,  (10)  becomes 

Min  (u^  +  uoej)x* 
subject  to 

«i4  -vA  +(iio  -vo)ej 
ub 

vb  +U0 

u,  V 

On  the  other  hand,  the  drawback  of  this  formulation  is  that  the  optimal  solution  sought 
may  not  exist.  Indeed,  it  is  known  [B74b,  B79]  that  the  linear  program  (11)  has  a  minimum 
if  and  only  if  Xx*  €  /j(A')  for  some  A  >  0.  For  important  classes  of  problems  this  condition 
is  always  satisfied.  But  for  others  it  is  not,  and  the  task  of  generating  a  strong  cut  using  (11) 
becomes  cumbersome. 


=  0 

>  P 

>  ^ 
>  0 


(11) 
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The  next  two  normalizations  are  aimed  at  guaranteeing  the  existence  of  a  finite  optimum 
in  (10). 

Normalization  3  :  We  require  that  ||a||<„  <  1,  by  defining  5  :=  {(a,P)  :  -1  <  a,-  <  l,i  = 

Normalization  4  :  We  require  that  (|oj|,  <  1,  by  defining  S  :=  {(a,/?)  :  l“«l  ^  !}• 

The  absolute  value  constraint  used  here  can  be  linearized  by  introducing  2n  new  variables 
of ,  o,“ ,  i  =  1, . . . ,  n,  writing 

S  :=  {(o,/?) :  o  =  o+  -  a~;a+,a~  >  0;  +  o^)  <  1}, 

i=l 

and  eliminating  o. 

3.2  Finite  termination 

While  there  are  several  finite  algorithms  for  pure  integer  programming,  finiteness  is  much 
harder  to  achieve  in  the  mixed  integer  case.  Gomory,  who  proved  that  his  cutting  plane 
algorithms  for  pure  integer  programming  are  finitely  convergent,  was  able  to  prove  finite 
convergence  of  his  mixed  integer  programming  algorithm  [G60]  only  for  the  case  when  the 
objective  function  is  itself  integer  constrained.  However,  this  assumption  cannot  be  made 
without  loss  of  generality;  and  if  the  assumption  is  removed,  Gomory's  algorithm  is  not  finite, 
as  shown  by  White  [W61]  (see  (S75]). 

The  first  finite  mixed  integer  programming  cutting  plane  algorithm  was  developed  by 
Jeroslow  [J80]  in  the  context  of  facial  disjunctive  programming,  of  which  (MIP)  is  a  special 
case.  To  guarantee  finiteness,  Jeroslow  uses  a  game  theoretic  framework  for  choosing  the 
cuts  to  generate.  His  convergence  proof  is  based  on  the  sequential  convexification  theorem  of 
Section  2.4,  and  uses  the  fact  that  every  cutting  plane  generated  in  the  algorithm  is  a  facet 
of  some  member  of  a  finite  family  of  polyhedra. 

In  this  section  we  give  a  finiteness  proof  for  a  particular  version  of  our  procedure.  Although 
our  proof  is  simpler  than  that  of  Jeroslow,  it  uses  the  same  basic  idea. 

The  general  procedure  outlined  in  Section  3.1  need  not  be  finitely  convergent.  To  insure 
finite  convergence,  additional  details  have  to  be  specified.  We  start  with  some  notation. 
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In  a  general  iteration  of  the  procedure  of  Section  3.1,  the  current  polyhedron  K*  is  defined 
by  the  inequalities  o{Ax>b  together  with  a  set  of  cuts.  For  j  €  {1, . .  -ip},  a  cut  that  appears 
in  the  defiiution  of  K*  is  called  a  j-cut  if  it  was  generated  as  a  cut  for  some  Pj{.),  i.e.,  from 
the  disjunction  Zj  =  0  V  Zj  si  1.  Let  Kj  be  the  polyhedron  defined  by  i4z  >  b  and  all  i-cuts 
for  *  =  1, . . ., j,  with  Kq  =  K.  Note  that  with  this  notation  K^,  =  K*  and  K]  $  K\  for  aU  i, j 
such  that  0  <  t  <  j  <  p. 

Let  Pj(K)s  :=  Pj{K)  0  5,  where  S  is  the  set  defined  in  Normalization  3  or  4. 
Specialized  Cutting  Plane  Algorithm 


0.  t  :=  1,^:1  :=  if  =  {z  G  R"  :  Az  >  6}. 

1.  Find  cz*  :=  mtn{cz  :  z  €  A"*}. 

Vx*j  6  {0, 1}  for  j  =  1, . . .  ,p,  stop. 

2.  Let  i  €  {1, . .  .,p}  be  the  largest  index  such  that  0  <  zj  <  1.  Generate  a  j-cut  cPx  >  ^ 
by  solving 

max{0  -  az‘ :  (q,/3)  €  i^CA'j.Os). 

3.  Define  by  adding  the  j-cut  a^x  >  to  the  constraints  of  K*. 

4.  Set  f  :=  f  +  1  ond  go  to  1. 

Theorem  3.1  The  Specialized  Cutting  Plane  Algorithm  finds  an  optimal  solution  to  (MIP) 
in  finitely  many  iterations. 

Proof.  We  need  to  prove  two  clmms: 

(i)  The  inequality  o^z  >  /P  generated  in  Step  2  cuts  off  z‘. 

To  prove  this,  we  show  that  z*,  an  extreme  point  of  K*(=  is  also  an  extreme  point  of 
Kj.  This  is  trivially  true  if  j  =  p,  so  assume  that  j  <  p.  Since  z|,  G  {0, 1),  it  follows  that 
Zp  €  n  {z  G  S”  :  Zp  =  z‘ }  C  Fp(A'‘_j)  C  A*  and  therefore  z‘  is  an  extreme  point  of 
A‘_i  n  {z  G  5?"  :  Zp  =  z‘ },  hence  also  of  Ap_j  (  since  Ap_i  n  {z  G  S?"  :  Xp  =  z‘ }  is  a  face 
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of  By  induction,  aince  x{  €  {0, 1}  for  it  =  p,p  —  1, . . . ,  j  +  1  it  follows  that  2^  is  an 

extreme  point  of  K^^i,  A’*_2, . . . , 

Next  we  show  that  x*  Since  Pj(A'j_j)  C  A'j,  if  x*  €  PjiKj_i)  then  x*  is  an 

extreme  point  of  But  all  extreme  points  of  PjiKj_i)  have  a  j*^  component  equal 

to  0  or  1,  whereas  0  <  z'-  <  1. 

Since  x*  ^  the  inequality  o^z  >  /P  generated  in  Step  2  is  violated  by  z‘. 

(it)  For  j  =  1, . . . ,p  the  number  of  j-cuts  generated  by  the  algorithm  is  finite. 

We  prove  this  by  induction.  The  statement  is  certainly  true  for  j  =  1  as  every  1-cut  generated 
corresponds  to  an  extreme  point  of  Pi(Kq)s  =  P{(K)s,  of  which  there  are  only  finitely  many, 
and  as  shown  in  Claim  1,  every  1-cut  generated  cuts  off  some  z*  that  satisfies  all  1-cuts 
generated  earlier. 

Suppose  now  that  the  statement  is  true  for  all  t  =  l,...,j  —  1  and  let  i  =  j.  By  the 
induction  hypothesis,  the  set  A'j.j  is  redefined  in  Step  3  of  the  algorithm  (by  addition  of 
some  i-cut  for  t  €  {l,..-,j  -  1})  only  a  finite  number  of  times.  Between  any  two  such 
redefinitions,  only  a  finite  number  of  j-cuts  can  be  generated,  since  each  j-cut  corresponds  to 
an  extreme  point  of  Pf(Kj_i)si  of  which  there  are  only  finitely  many,  and  each  j-cut  cuts  off 
some  X*  which  satisfies  all  j-cuts  generated  earlier.  Hence  only  a  finite  number  of  ^-cuts  are 
generated  during  the  entire  algorithm,  which  completes  the  induction.  ■ 

3.3  Cuts  from  the  basis  inverse 

Let  X*  be  the  current  fractional  solution  in  the  iteration  of  the  cutting  plane  procedure,  i.e. 
cz‘  =  min{cx  :  x  €  if*}, if*  =  {z  €  3?"  :  j4‘z  >  6*}.  In  general,  for  a  cut  ax  >  in  Pj{K*) 
defined  by  (8),  the  variables  u,-  and  Vi  can  be  strictly  positive  even  if  the  corresponding 
constraint  ^  satisfied  as  equality  by  z*.  In  the  case  where  we  impose 

that  the  only  that  are  allowed  to  be  strictly  positive  are  those  for  which  the  slack 
corresponding  to  the  constrmnt  a\jXj  >  b*  is  nonbasic,  a  cut  can  be  obtained  without 
having  to  solve  a  linear  program,  as  shown  below: 

Let  H*  be  the  matrix  obtained  from  A*  by  keeping  only  the  rows  corresponding  to  con- 
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straints  for  which  the  associated  slack  variable  is  nonbasic.  Let  C  =  {z  €  :  B^x  >  d*} 

be  the  polyhedron  defined  by  those  inequlities  of  A*x  >  If  corresponding  to  the  rows  of  B*. 
Then,  by  applying  Theorem  2.10  to  Pj{C)  and  eliminating  the  variables  a,-,t  =  1, . .  .,n,  and 
£rom  the  corresponding  system  (8)  we  get: 

{v^-v^)B*  =  (t;^-u?)e, 

(tt® J>1  +  S2  =  »0  <12) 

tl®,0®,Si,S2  >  0 

where  si  and  S2  are  the  nonnegative  slack  variables  corresponding  to  the  inequalities  ub> 
and  vh+«o  >  ^  in  (8).  Since  the  matrix  B*  is  invertible,  given  u® — Uo  >  the  vector  z®  =  tt®— 1>® 
is  uniquely  determined  from  the  first  n  constraints  of  (12).  Moreover,  a  basic  solution  to  (12) 
satisfies  uf.vf  =  0  for  all  t,  which  implies  that  u®  and  v®  are  uniquely  defined  from  z® 
as  its  positive  and  negative  parts  respectively  (u®  =  (z®)"*",®®  =  (z®)“).  This  leads  os  to 
investigate  the  cuts  that  are  obtmned  by  taking  as  our  normalization  constraint  the  equality 
Vq  —  —  1.  These  cuts  are  uniquely  defined  from  (12)  except  for  the  values  of  si  and  S2. 

Whenever  si  =  S2  =  0  we  get  the  following  simple  formula  for  the  cut  ax  >  0  : 

«®  =  (Bj-‘)+ 
r®  =  (Bj-y 

a,  =  (Bj~^)'*'B*f  t  =  1, . . . ,  n,  i  j,  (13) 

=  (Bj'YB^  +  Byd^-l 

0  =  (^r‘)v 

where  denotes  the  j  —  th  row  of  B*~^  and  =  (J?j~')+  —  (5j“')“.  It  is  important  to 

point  out  that  the  j  -  th  row  of  the  matrix  B*~^  is  readily  available  from  the  simplex  tableau 
defining  the  solution  z*. 

Furthermore,  the  cut-hyperplane  ax  —  0  goes  through  the  points  where  the  boundary 
hyperplane  Zj  =  0  or  Zj  =  1  of  the  set  0  <  Zj  <  1  is  intersected  by  the  rays  of  the  cone 
with  apex  at  x*  defined  by  the  inequalities  in  C.  Therefore  the  cut  obtained  this  way  is  the 
intersection  cut  associated  with  the  convex  set  0  <  Zj  <  1.  Intersection  cuts  were  introduced 
by  Balas  [B71],  [B74a],  see  also  Glover  [G173]. 
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We  end  this  section  with  another  normalization  that  has  an  interesting  property.  Let 
K*,A*,b*,  and  x*  be  defined  as  in  the  catting  plane  procedure  of  Section  3.1. 

Normalization  5  :  We  require  that  vb  —  uo  =  1* 

It  can  be  shown  that  whenever  the  last  two  inequalities  of  (8)  are  required  to  hold  with 
equality,  the  unique  cut  obtained  by  imposing  this  normalization  is  the  intersection  cut  (13). 

3.4  Cut  Lifting 

In  this  section  we  show  that  cutting  planes  with  essentially  the  same  properties  as  those 
derived  by  the  procedure  of  Section  3.1  can  be  obtmned  from  a  smaller  linear  program  than 
the  one  over  Pj{K*)s,  by  working  in  the  subspace  defined  by  the  fractional  components  of 
X*,  and  then  lifting  the  inequality  into  the  ori^nal  space.  This  is  important  not  only  because 
it  is  a  computationally  cheaper  way  of  getting  essentially  the  same  cut,  but  also  because  in 
a  branch  and  cut  context  it  provides  a  way  of  generating  cutting  planes  at  one  node  of  the 
search  tree  and  lifting  them  into  cutting  planes  valid  at  every  node. 

Let  us  consider  the  LP  needed  to  generate  a  cut  ax  >  0  with  Normalization  2,  given  by 
(11).  Let  F  =  {t  €  {1,.-mP}  :0  <  z|  <  1}  U  {t  €  {p+  l,...,n}  :  >0).  W.l.o.g.  we  can 

assume  that  if  i  €{!,.••, n),  i  ^  F  then  i{  =  0,  since  for  those  i  such  x*  =  1  the  variable 
Xi  can  be  complemented  by  changing  the  sign  of  A,-,  the  column  of  A,  and  replacing  b  by 
b  -  Ai. 

Consider  the  problem  derived  from  (11)  by  removing  from  A  all  the  columns  corresponding 
to  {1, ... ,  n}\F.  Let  Ak  denote  the  column  of  A,  and  let  A^  be  the  column  obtained  from 
Ak  by  removing  the  components  (all  equal  to  0)  corresponding  to  the  inequalities  zf,  >  0,  h  ^ 
F  and  -Xk  >  -1  for  h  €  {1, . .  .,p}\F.  Then  (11)  can  then  be  rewritten  as: 
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+ “0*5 

subject  to 

tiAf  -vA^  =  0  keF\{j} 

uAf  ~vAf  +tio  -«o  =  0 

ub  >  0 

vb  -i-vo  >  p 

tt,  V  >  0 


where  0  s=  1  at  fi  ^  -1. 


(14) 


Note  that  some  of  the  variables  and  constraints  in  (11)  are  not  present  in  (14).  The 
constraints  that  are  missing  are: 

uAk  -  vAk  =  0,  F 

while  the  missing  variables  are  ti«+,  for  t  (f  F  and  Um+n+i  for  i  €  {!,. . .,p),*  <  F.  Here 
m  +  1, , . . ,  m  +  n  are  the  indices  associated  with  the  primal  constraints  i*  >  0,  /»  =  1, . . . ,  n 
and  m  +  n  +  l,...,m  +  n  +  pare  those  associated  with  the  primal  constraints  -Zk  >  -1, 
h  =  1, . .  .,p. 

The  following  theorem  shows  how  an  optimal  solution  to  (14)  can  be  extended  to  an 
optimal  solution  to  (11). 


Theorem  3.2  Let  be  an  optimal  stratum  to  (14).  Extend  to  a  solution 

(tt,u)  o/(ll)  by  defining: 


**»n+« 


®m+i 

^+n+< 


uf  forisl,.. 

.,m. 

v[  /orisl,.. 

.,m. 

if  v^A^  >  v^A^ 

fori^F, 

otherwise 

1  («^-i?'').4f 

ifu^^Al  > 

for  i^F, 

1  0 

otherwise 

tim+n+.=0  /or  i  €  p},i<  F. 


Then  (u,v)  is  a  basic  feasible  optimal  solution  to  (11). 
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Proof.  By  construction  (u,  v)  satisfies  all  the  constraints  of  (11)  missing  from  (14),  while  the 
remaining  constraints  are  not  affected.  Thus  (u,v)  is  feasible  for  (11). 

To  see  that  (u,v)  is  basic,  note  that  (u^,v^)  is  a  basic  solution  to  (l4),  and  (tZ,v)  con¬ 
tains  exactly  one  extra  positive  component  for  every  constraint  of  (11)  missing  from  (14). 
Furthermore  note  that  the  missing  constraints  are  affinely  independent  from  each  other  and 
from  the  constraints  of  (14),  so  their  addition  to  (14)  increases  the  rank  of  the  latter  exactly 
by  their  number.  Thus  (2,u)  is  basic  for  (11). 

Now  let  be  the  optimal  solution  to  the  dual  of  (14)  associated  with  (u^,v^).  Ebctend 
to  a  feasible  solution  to  the  dual  of  (11),  by  setting  to  0  all  components  associated  with 
those  constraints  of  (11)  missing  from  (14).  Then  the  reduced  costs  of  (14)  remain  unchanged 
in  (11).  As  to  the  reduced  costs  of  the  variables  of  (11)  missing  from  (14),  the  situation  is  as 
follows: 

For  the  variables  Sm+n-ft  for  t  €  {1, . .  .,p},  *  0  F,  the  reduced  cost  is 

-*fc  -  (since  ij^  =  0) 

where  z^  is  the  dual  variable  associated  with  the  next  to  last  constrmnt  of  (11).  Since  that 
constraint  is  an  inequality,  z^  >  0. 

For  the  variables  Vto+h+i  for  j  €  {1, . .  .,p},J  ^  F,  the  reduced  cost  is 

0  -  (-2^)  =  2^,  >  0, 

where  z^  is  the  dual  variable  associated  with  the  last  inequality  of  (11). 

For  the  variables  Um+i  for  i  €  {l,...,n},t  ^  F,  the  reduced  cost  is 

-  (0)  =  0  (since  x\  =  0) 

whereas  that  associated  with  the  variables  for  i  G  .  .,n},i  ^  F,  the  reduced  cost  is 

0  -  (0)  =  0. 

Thus  all  the  reduced  costs  are  nonnegative  and  hence  (u,v)  is  optimal  for  (11).  ■ 

Theorem  3.2  does  not  carry  over  directly  to  Normalizations  3  and  4.  However,  it  '•.c.n  be 
proved  for  the  following  variants  of  these  normalizations. 
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Normalization  3' :  5  =  {{a,0) :  -1  <  or.  <  1,  for  i  €  F). 

Normalization  4' :  5  =  {(o,/3)  ;  1®»I  ^  !}• 

3.5  Cut  Strengthening 

The  cutting  planes  ax  >  0  in  Pj{K)  for  some  j  €  {1, .  •  ..p}.  he  strengthened  by  using 
the  integrality  condition  on  variables  oth^r  than  Zj,  as  shown  by  Balas  and  Jeroslow  [BJ80]. 

Consider  the  system  (8)  of  Section  2,  defining  P^(K).  If  we  separate  the  non-negativity 
constraints  z  >  0  from  the  rest  of  the  inequalitiec  in  ^z  >  h,  i.e.  write  the  system  without 
z  >  0  as  Az  >  ft,  z  >  0,  then  (8)  becomes 

a  -uA  —tLotj  >  0 

a  — wA  — voCj  >  0 

SS  >  0  (15) 

vb  -i-uo  >  0 

u,v  >  0 

and  the  coefficients  of  the  cut  ax  >  0  can  be  written  as 

Ofc  =  maz|o:J,  ot|},  *:  = 

0  =  min{0i,02} 

where 

al  =  uAfc,  al  =  vAk,  for  fc  =  1, . .  .,p,fc  ^  j; 

oj  =  uAj  -  oo,  Qj  =  vAj  +  Wo 

(with  Ak  denoting  the  kth  column  of  A)  and 

0^  =  25,  =  w5  -I-  Wo. 

The  cutting  plane  ax  >  0,  with  0^0,  can  be  strengthened  to  7Z  >  where  0  is  defined 
as  above,  while  7  is  given  by; 
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7* 


(16) 


=  +  for  fc  = 


7*  =  for*  =  P  +  1,... ,n; 


with 


m-k 


tlo(^2|  +  «o|/?M 
For  the  validity  of  this  streagthening  see  [BJ80,  B79]. 


(17) 


Note  that  if  =  0  the  coefficient  7/^  in  the  strengthened  cut  will  be  the  same  as  in  the 
unstrengthened  cut.  It  can  be  shown  that  this  is  the  case  for  all  components  k  such  that 
=  1  when  =  /3^,  a  frequently  occurring  case  in  practice.  By  complementing  the  variable 
Xk  and  then  applying  the  strengthening  procedure,  we  may  get  mii  ^  0.  In  our  computational 
experiments  we  followed  this  practice. 

We  also  note  that,  while  a  linear  transformation  of  the  system  Ax  >b  leaves  K,  Pj{K) 
and  its  facets  unchanged,  it  can  change  the  effect  of  the  strengthening  procedure. 


Suppose,  for  instance,  that  instead  of  applying  the  disjunction  Xj  =s  0  V  xj  %  1  to  the 
system  Ax  >  6,  we  first  solve  the  linear  program  min{cx  :  Ax  >b}  to  obtain  for  .4x  >  6  the 
expression: 


=  0.0 + 53  °«*(-®*) 
keJ 

>  0,  keiuJ, 


for  » €  / 


(18) 


where  /,  J  index  the  basic  and  nonbasic  variables  respectively,  and  then  restate  (18)  as 

-  dikXk  >  -a,o,  t  e  /  (19) 

fc€J 

Xk  >  0,  k£j. 


Suppose  also  that  j  €  /;  then  the  disjunction  Xj  =  0  V  xj  =  1  becomes 

53  ^  H  ^  ^  (20) 

k€J  k€J 

Now  the  cuts  defined  from  (19),  (20)  will  be  expressed  in  terms  of  the  variables  Xk,  k  £  J\ 
and  although  these  cuts  are  equivalent  to  the  ones  obtained  from  Ax  >b,Xj  =  Qv  Xj  =  1, 
when  it  comes  to  the  strengthening  procedure,  the  outcome  will  in  general  be  different  in  the 


two  cases.  First  of  all,  in  one  case  the  strengthening  procedure  can  be  applied  to  all  coefficients 
whereas  in  the  other  case  only  to  the  coefficients  Ofc  for  k  €  {1, 

Second,  the  different  numerical  values  may  yield  different  strengthening  parameters  m*  in  the 
two  cases. 

One  reason  to  look  at  the  family  of  strengthened  cuts  derived  from  (19),  (20)  is  that  the 
Gomory  cut  for  mixed  integer  programming  (G60]  is  a  member  of  this  family.  To  see  this, 
it  suffices  to  look  at  the  general  form  of  the  above  cut  and  assign  some  special  values  to  the 
multipliers  used  in  its  derivation.  To  do  this,  we  have  to  restate  the  system  (15)  defining 
Pj{K)  in  terms  of  the  variables  used  in  (19),  (20): 


Ok  ® 

a*  +wod>fc  >  0 

+UofljO  -  P 

+»o(l-djio)  >  /? 


far  k^  J 
for  keJ 


As  before,  we  can  write  the  cut  as  ai  >  /3,  with  a*  =  max  and  0  —  m»n  {0iy02) » 

where 


a*  =  —  +  tioOjfc,  Ofc  —  —  ICtg/ k£j 

0>  ■=■  -Eig/ti-aio  +  ttoUjo,  0^  = 

The  strengthened  cut  then  becomes  71  >  ^  where  7fc  is  defined  by  (16),  (17). 

Theorem  3.3  The  mixed  integer  Gomory  cut  [G60]  is'yx  >  ■^  with  the  choice  of  multipliers: 


ttj  =  0,  v}  =  0,  *  €  /, 

uo  =  l/djo,  »o  =  1/(1 -Ojo)- 

Proof.  Using  the  multipUers  defined  in  the  theorem  we  obtain: 

ol  =  Ojk/ajo,  a*  =  *"6jfc/(l  ”  Oio)i  k^J 
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and  01  =  13^  =  Substituting  these  values  into  (16)  and  (17)  yields  mt  =  — dyjb  and 

7fc  =  min|°^*^J  for  fe  =  l,...,p 

(.  <*>o  (l-«io)  ) 

7fc  =  max|^,  -— 1*  -1  forfc  =  p+l,...,n 
(ojo  (l-aio)J 

which  is  the  mixed  integer  cut  of  [G60].  ■ 

4  Computational  Experience 

Preliminary  versions  of  the  cutting  plane  procedure  discussed  in  Section  3.1,  with  Normaliza¬ 
tions  2,  3,  and  4,  were  tested  ou  several  classes  of  problems.  At  every  iteration,  Step  2  was 
applied  to  every  j  6  {1, . .  .,p}  such  that  0  <  ij  <  1;  i.e.,  a  cut  was  generated  for  every  0-1 
variable  that  took  a  fractional  value  in  the  optimal  solution  to  the  linear  program  solved  in 
Step  1.  The  cuts  were  generated  in  the  subspace  defined  by  the  fractional  variables  and  then 
lifted  with  the  procedure  of  Section  3.4  into  the  full  space.  The  strengthening  procedure  of 
Section  3.5  was  then  applied  to  every  cut  generated.  The  strengthened  cuts  were  considered 
in  the  order  of  the  amount  by  which  they  were  violated  by  the  current  solution  (most  vio¬ 
lated  first),  and  a  cut  was  added  to  the  constraint  set  if  the  cosine  of  the  angle  between  its 
normal  vector  and  that  of  all  previously  added  cuts  differed  by  at  most  6  <  1,  where  6  is  a 
parameter  chosen  by  the  user  (Here  we  took  6  =  0.999).  Similarly,  the  remaining  cuts  were 
considered  one  at  a  time  starting  with  the  most  violated  one.  Each  cut  was  compared  with 
all  previously  added  cuts  for  this  iteration,  using  the  parameter  9  to  decide  whether  to  add 
it  to  the  formulation.  The  experiments  were  run  for  a  maximum  of  30  iterations,  and  if  the 
objective  function  failed  to  improve  significantly  over  several  consecutive  iterations,  the  run 
was  stopped  earlier. 

The  linear  programs  encountered  during  the  procedure  were  solved  using  the  CPLEX 
library.  For  the  purpose  of  benchmarking  and  comparison,  the  test  problems  were  also  solved 
with  a  branch  and  bound  code  (LINDO  for  most  of  the  problems,  Carpaneto  and  Toth’s 
[CT80]  for  the  TSP’s),  as  well  as  with  a  procedure  using  Gomory’s  cutting  planes.  For  better 
comparability,  the  Gomory  cuts  were  used  in  the  framework  of  our  procedure;  i.e.  in  Step 
2  of  our  procedure,  instead  of  generating  one  of  our  cuts  for  each  j  €  {1,. .  .,p}  such  that 
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0  <  Zj  <  1,  a  oiixed  integer  Gomory  cut  was  generated  from  each  row  of  the  simplex  tableau 
corresponding  to  a  (basic)  variable  xj  such  that  0  <  z*-  <  1.  As  with  our  procedure,  we  used 
the  parameter  $  to  decide  which  cuts  to  add  to  the  formulation.  The  tests  were  run  on  a  SUN 
Sparcstation  330. 

We  considered  four  different  classes  of  test  problems.  The  first  one  is  a  set  of  unstructured 
0-1  programs  where  Normalization  2  does  not  apply.  The  set  is  used  to  compare  Normal¬ 
izations  3  and  4  with  Gomory  cuts.  The  second  class  is  a  set  of  randomly  generated  vertex 
packing  problems  where  we  compare  Normalization  2  with  Gomory  cuts.  The  third  class  are 
fixed-charge  network  problems  formulated  as  mixed  0-1  programs.  The  last  class  consists  of 
two  real  world  asymmetric  TSP’s  where  our  algorithm  is  compared  with  a  problem  specific 
branch  and  bound  algorithm. 

The  first  class  of  test  problems  consists  of  a  set  of  pure  0-1  programs  with  fairly  large 
integrality  gaps  (i.e.,  differences  between  the  value  of  the  LP  and  IP  optima).  The  BM 
problems  are  tightly  constrained  general  0-1  programs  with  positive  and  negative  coefficients, 
randomly  generated  by  Bouvier  and  Messounaian  [BM65].  The  LSB  and  LSC  problems  have 
a  real  world  origin  and  are  taken  from  Lemke  and  Spielberg  [LSp67].  The  PE  problems  are 
capital  budgeting  (multiple  knapsack)  models,  with  all  positive  coefficients,  originating  with 
Peterson  [PE67].  All  of  these  problems  are  also  described  in  [BMaSO].  The  CJP  problems  are 
from  Crowder,  Johnson  and  Padberg  [CJP80]  and  have  a  real  world  origin.  Table  1  describes 
the  problems  by  giving  the  number  of  their  variables  and  constraints,  the  value  of  the  LP 
optimum  and  the  integer  optimum,  and  the  number  of  search  tree  nodes  it  took  LINDO’s 
branch  and  bound  code  to  solve  them. 
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Problem 

name 

Number  of 

construnts 

Number  of 

variables 

Value  of 

LP  optimum 

Value  of 

IP  optimum 

Branch  k  bound 

tree  nodes 

BM13 

15 

15 

14.96 

26 

26 

BM14 

15 

15 

1.50 

2 

2 

BM19 

25 

20 

31.05 

47 

16 

BM20 

27 

20 

33.96 

47 

14 

BM22 

20 

28 

19.31 

33 

224 

BM24 

20 

28 

25.78 

38 

242 

LSB 

28 

35 

521.05 

550 

42 

LSC 

12 

44 

56.61 

73 

934 

PE4 

10 

20 

-6155.33 

-6120 

76 

PE5 

10 

28 

-12462.10 

-12400 

98 

PE6 

5 

39 

-10672.34 

-10618 

84 

PE7 

5 

50 

-16612.82 

-16537 

476 

CJP33 

15 

33 

2520.57 

3089 

7086 

CJP40 

23 

40 

61796.54 

62027 

104 

CJP201 

133 

201 

6875.00 

7615 

1730 

CJP282 

241 

282 

176867.50 

258411 

10252 

Table  1 


Table  2  shows  the  difference  between  running  our  algorithm  with  and  without  cut  strength¬ 
ening. 


Normalization  4 

Strengthening 

No  Strengthening 

Problem 

Name 

Cuts  Iterations  %  Gap 

closed 

Cuts  Iterations  %  Gap 

closed 

LSB 

LSC 

150  14  100 

51  7  100 

261  30  91 

317  30  84 

Table  2 
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Table  3  compares  two  versions  of  our  algorithm  with  a  version  using  Gomory  cuts.  Table  4 
shows  our  results  on  the  PE  and  CJP  problems. 


Normalization  3 

Normalization  4 

Gomory  cuts 

Problem 

Cuts 

Iterations 

%  Gap 

Cuts 

Iterations 

%  Gap 

Cuts 

Iterations 

%  G^ 

Name 

dosed 

dosed 

dosed 

BM13 

370 

30 

100 

28 

4 

100 

110 

30 

42 

BM14 

2 

1 

100 

2 

1 

100 

2 

1 

100 

BM19 

121 

13 

100 

42 

4 

100 

112 

30 

79 

BM20 

132 

15 

100 

23 

4 

100 

115 

30 

79 

BM22 

405 

30 

51 

446 

30 

55 

93 

30 

32 

BM24 

514 

30 

53 

536 

30 

53 

103 

30 

32 

LSB 

461 

30 

97 

150 

14 

100 

65 

30 

65 

LSC 

370 

30 

100 

51 

7 

100 

35 

7 

100 

Table  3 


Normalization  4 

Gomory  cuts 

Problem 

Cuts 

Iterations 

%  Gap 

Cuts 

Iterations 

%Gap 

Name 

dosed 

closed 

PE4 

128 

24 

100 

46 

30 

94 

PE5 

148 

30 

92 

40 

30 

67 

PE6 

276 

30 

79 

40 

30 

35 

PE7 

302 

30 

86 

46 

30 

24 

CJP33 

558 

30 

77 

168 

30 

72 

CJP40 

19 

5 

100 

12 

4 

100 

CJP201 

1024 

15 

73 

365 

30 

66 

CJP282 

521 

15 

96 

350 

30 

22 

Table  4 


As  it  can  be  seen  from  Tables  3  and  4,  the  procedure  with  Normalization  4  found  optimal 
solutions  to  8  of  the  16  problems.  Normalization  4  seems  consistenly  better  than  3.  As 
mentioned  earlier,  we  ran  the  procedure  for  at  most  30  iterations  in  each  case.  Although 
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the  tables  do  not  reflect  this,  most  of  the  gap  reduction  tends  to  happen  during  the  first 
few  iterations.  In  a  few  instances,  like  CJP201  and  CJP282,  after  15  iterations  progress  had 
slowed  down  to  the  extent  that  prompted  the  termination  of  the  run. 

Figure  1  contains  detmled  information  for  the  solution  of  the  problem  BM20,  iteration 
per  iteration. 

The  next  class  of  problems  consists  of  randomly  generated  vertex  packing  (maximum 
stable  set)  problems.  We  present  the  solution  of  one  problem  on  a  graph  with  30  vercices 
and  30%  edge  density  and  three  problems  on  graphs  with  100  vertices  and  densities  5, 10  and 
15%.  Since  all  the  cuts  ax  >  for  this  problem  have  a  <  0  and  ^  <  0,  Normalization  2  was 
used  (with  =  -1).  Table  5  contains  the  problem  data;  the  last  column  gives  the  number 
of  search  tree  nodes  generated  by  LINDO's  branch  code.  Table  6  contains  the  results  of  the 
runs.  For  the  30-vertex  problem  we  started  with  the  edge  formulation  FS{G)  as  described 
in  Section  2.5.  The  initial  LP  solution  was  all  fractional  (xj  =  1/2,  for  j  =  1,...,30).  The 
problem  was  solved  in  10  iterations  after  adding  242  cuts.  It  is  interesting  to  note  that  the 
cutting  planes  generated  in  iteration  1,  2  and  3  were  almost  exclusively  clique  inequalities. 
The  striking  point  is  that  the  cuts  do  not  deteriorate  as  quickly  as  with  other  general  cutting 
plane  algorithms.  For  example,  in  iteration  4,  after  90  cuts  had  already  been  added  to  the 
problem,  the  cuts  that  were  generated  included  clique  inequalities,  odd  hole  inequalities  and 
lifted  odd-hole  inequalities.  Even  after  iteration  4,  the  cutting  planes  generated  had  small 
integer  left-hand-side  coefficients  (after  appropiate  scaling  of  the  right-hand-side  coefficient). 
For  the  100- vertex  problems,  we  started  from  a  formulation  in  which  the  rows  of  the  constraint 
matrix  represent  a  clique  cover  of  all  the  edges.  Each  clique  in  the  cover  is  obtained  in  a  greedy 
fashion  by  starting  with  a  vertex  of  largest  degree  and  expanding  the  clique  with  a  neighbor 
of  largest  degree  until  the  clique  is  maximal.  The  cutting  plane  algorithm  was  able  to  solve 
problem  VP5  in  6  iterations  (this  problem  took  4000  nodes  when  solved  by  LINDO’s  branch 
and  bound  code),  but  it  ran  into  difliculties  with  problem  VP15  after  closing  2/3  of  the 
integrality  gap. 
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Nonnalization  3 
•Nonnalization  4 
Gomoiy  cuts 


Problem 

name 

Number  of 

constraints 

Number  of 

variables 

Value  of 

LP  optimum 

Value  of 

IP  optimum 

Branch  ic  bound 

tree  nodes 

VP30 

168 

30 

15.00 

7 

62 

VPS 

211 

100 

45.25 

42 

4000 

VPIO 

356 

100 

38.00 

31 

1024 

VP15 

440 

100 

33.86 

25 

2670 

Table  5 


Normalization  2 

Gomory  Cuts 

Problem 

Cuts 

Iterations 

%  Gap 

Cuts 

Iterations 

%  Gap 

Name 

closed 

closed 

VP30 

242 

10 

100 

466 

30 

62 

VPS 

398 

6 

100 

1015 

30 

66 

VPIO 

1129 

13 

91 

1239 

25 

17 

VP15 

619 

7 

67 

773 

30 

6 

Table  6 


The  next  class  of  problems  we  considered  are  fixed-charge  network  flow  problems  of  the 
form: 

Min 

subject  to 

HjVij  -HjVji  =  for  all  t 

Vij  <  VijXij  for  all  arcs  (1,7) 

y  >  0,  Xij  €  {0,1}  for  all  arcs  (i,j) 

We  randomly  generated  the  problems  as  follows.  The  first  three  problems  are  fixed-charge 
capacitated  transportation  problems,  i.e.  fixed-charge  network  flow  problems  on  a  bipartite 
graph.  The  next  thi  are  fixed-charge  problems  on  general  networks.  The  arcs  in  the 

network  are  randomly  generated  to  match  a  specified  density  of  the  graph.  For  the  first  class 

we  generated  three  problems,  CTRl,  CTR2,  and  CTR3,  with  each  of  the  two  node  sets  in 
the  bipartite  graph  of  sizes  10,  15,  and  20,  and  densities  80%,  50%  and  35%,  respectively. 
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For  the  second  class  we  generated  three  problems,  FXCl,  FXC2,  and  FXC3,  on  networks 
with  10,  15,  and  20  nodes,  and  the  densities  of  80%,  50%  and  35  %,  respectively.  For  both 
classes,  the  fixed  cost  Cij  of  opening  an  arc  (t,  j)  was  randomly  generated  as  an  integer  in  the 
range  [0,20],  the  variable  cost  bij  of  using  an  arc  as  a  real  number  in  the  range  [0,2],  and  the 
capacity  Ujj  of  an  arc  as  an  integer  in  the  range  [1,20].  The  demands  and  supplies  bi  where 
randomly  generated  as  integers  in  the  range  [—20, 20]  and  so  that  they  satisfy  XT  =  0- 

Table  7  contains  the  description  of  these  problems,  including  the  number  of  constraints,  the 
number  of  0-1  and  continuous  variables  and  the  number  of  nodes  that  it  took  LINDO’s  branch 
and  bound  to  solve  them.  Table  8  contains  the  results  with  our  cutting  plane  procedure. 


Problem 

name 

Constraints 

Integer 

variables 

Continuous 

variables 

Value  of 

LP  optimum 

Value  of 

IP  optimum 

B  &  B 

tree  nodes 

CTNl 

103 

83 

83 

128.58 

183.34 

180800 

CTN2 

150 

120 

120 

169.79 

239.21 

1060254* 

CTN3 

182 

142 

142 

313.80 

432.28 

465382* 

FXCl 

92 

82 

82 

46.10 

62.62 

740 

FXC2 

123 

108 

108 

116.19 

148.91 

1352 

FXC3 

161 

141 

141 

152.01 

197.98 

773096 

Table  7 


^Number  of  nodes  at  which  ran  was  stopped  without  finding  an  optimal  solution. 
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Normalization  4 

Problem 

Cuts 

Iterations 

%  Gap 

Name 

closed 

CTNl 

434 

15 

94 

CTN2 

511 

15 

99 

CTN3 

582 

15 

95 

FXCl 

215 

15 

98 

FXC2 

388 

15 

95 

FXC3 

451 

15 

90 

Table  8 


We  note  that  although  the  problems  in  this  set  are  very  difficult  for  a  general  branch  and 
bound  algorithm,  our  procedure  manages  to  close  most  of  the  integrality  gap,  as  illustrated 
by  the  data  in  Table  8.  LINOO’s  branch  and  bound  code  was  not  able  to  solve  problems 
CTN2  and  CTN3  to  optimality  when  using  the  standard  linear  programming  relaxation  for 
bounding.  In  order  to  solve  both  problems  we  applied  branch  and  bound  to  the  strengthened 
linear  program  resulting  after  8  iterations  of  our  algorithm.  In  Table  9  we  show  for  problem 
CTN2,  the  benefits  of  strengthening  the  formulation  before  applying  branch  and  bound.  The 
results  are  given  for  a  number  of  iterations  of  our  strengthening  procedure  ranging  from  0 
to  11.  We  report  the  total  number  of  cuts  generated  by  our  algorithm,  the  CPU  time  (in 
minutes)  taken  to  generate  these  cuts,  the  number  of  nodes  in  the  branch  and  bound  tree 
needed  by  UNDO  to  solve  the  problem,  and  the  total  computing  time  taken  (in  minutes) 
including  cut  generation. 
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Iteration 

number 

Cuts 

CPU 

time 

B&B 

tree  nodes 

Total  CPU 

time 

0 

D 

0.00 

>1000000 

>2000 

1 

27 

0.38 

>600000 

>2000 

2 

47 

0.63 

360012 

1640.17 

3 

76 

1.78 

42120 

335.28 

4 

no 

3.87 

32758 

391.78 

5 

150 

10.32 

11498 

224.25 

6 

185 

15.57 

1486 

86.57 

7 

222 

21.52 

3522 

215.52 

8 

274 

50.09 

2548 

288.32 

9 

322 

76.64 

694 

167.10 

10 

348 

81.93 

622 

214.78 

11 

382 

94.58 

>500 

>2000 

Table  9 


The  last  class  of  test  problems  consists  of  two  difficult  real  world  asymmetric  traveling 
salesman  problems.  The  two  problems  that  we  ran  have  17  and  43  nodes  (cities),  respectively. 
They  are  samples  of  scheduling  problems  that  arise  regularly  at  chemical  plants  of  the  Dupont 
Company.  These  problems  proved  to  be  hard  to  solve  by  other  existing  methods.  In  particular, 
the  branch  and  bound  code  of  Carpaneto  and  Toth  [CP80],  one  of  the  most  efficients  codes 
for  this  problem,  took  more  than  110,000  nodes  in  the  branch  and  bound  tree  to  solve  the 
problem  on  17  cities.  For  the  problem  on  43  cities,  it  could  not  find  a  tour  after  running 
for  25.2  CPU  hours,  exceeding  the  memory  limitations  and  enumerating  more  than  580,000 
nodes  in  the  branch  and  bound  tree. 

We  ran  our  algorithm  on  these  problems  with  a  change  in  Step  1.  Instead  of  the  standard 
linear  programming  relaxation,  the  algorithm  starts  by  solving  the  assignment  problem.  If  the 
LP  solution  is  a  tour,  the  algorithm  stops.  If  the  LP  solution  is  integer  (i.e.  an  assignment) 
but  not  a  tour,  the  algorithm  identifies  the  subtours  and  adds  to  the  linear  program  the 
corresponding  subtour  elimination  inequalities  and  repeats  Step  1.  If  the  solution  to  the 
current  linear  program  is  fractional,  Step  2  of  the  algorithm  is  applied,  i.e.  a  family  of  cutting 
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planes  is  generated. 

For  the  17  city  problem  our  algorithm  needed  10  iterations  to  solve  the  problem  to  opti¬ 
mality,  four  of  them  corresponding  to  the  generation  of  subtour  elimination  constraints.  A 
total  of  151  cuts  were  added  during  the  procedure. 

In  the  case  of  the  43  city  problem,  in  view  of  the  large  number  of  edges,  we  had  to  use  cost 
matrix  sparsihcation  techniques.  To  get  an  initial  sparse  cost  matrix  we  generated  several 
“good”  tours  heuristically  and  took  the  union  of  their  arc  sets  as  the  only  arcs  of  our  graph. 
We  then  applied  to  this  sparse  problem  bur  cutting  plane  procedure  in  its  modified  form 
described  above.  When  an  optimal  tour  was  obtained  for  the  sparse  problem,  the  reduced 
costs  of  the  missing  arcs  were  checked,  and  a  subset  ol  the  arcs  with  negative  reduced  costs 
was  added  to  the  problem.  The  procedure  was  then  repeated. 

The  best  earlier  solution  to  this  problem,  of  value  5621,  was  found  by  Repetto  [1191]  using 
a  collection  of  heuristics.  Our  procedure  was  able  to  find,  after  the  second  set  of  edges  was 
chosen,  and  20%  of  the  arcs  were  present,  a  solution  of  value  5620.  After  several  additional 
iterations  there  were  still  missing  arcs  with  negative  reduced  costs,  and  the  procedure  was 
stopped  as  computationally  too  expensive. 

A  second  approach  was  then  tried.  The  linear  programming  relaxation  was  solved  with 
the  fully  dense  cost  matrix,  with  only  a  subset  of  the  subtour  elimination  constraints.  The 
lower  bound  LB  obtained  this  way  was  then  used  in  conjunction  with  the  upper  bound 
UB  =  5620  to  fix  at  0  all  variables  whose  reduced  cost  exceeded  UB  -  LB.  Cutting  planes 
were  then  generated,  which  raised  the  value  of  LB  to  5616.  At  this  point,  with  an  integrality 
gap  of  5620  -  5616  =  4,  the  problem  containing  only  those  constraints  tight  at  the  last  linear 
programming  optimum,  was  fed  to  LINDO’s  branch  and  bound  code.  This  linear  program  had 
a  total  of  172  constraints,  86  of  them  corresponding  to  assignment  constraints,  57  to  subtour 
elimination  constraints  and  29  to  cuts  generated  with  the  cutting  plane  algorithm.  Alter 
enumerating  374  nodes  in  the  branch  and  bound  tree  the  code  was  able  to  prove  optimality 
of  the  solution  with  value  5620  and  the  problem  was  solved. 

These  computational  experiments,  although  preliminary,  suggest  several  conclusions.  First, 
like  with  other  cutting  plane  algorithms,  the  effect  of  the  cutting  planes  on  the  objective 
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function  value,  and  hence  on  the  integrality  gap,  is  more  significant  at  the  beginning  and  less 
significant  as  more  cuts  are  generated.  This  is  partly  due  to  the  fact  that  as  the  number 
of  cuts  increases,  their  direction  tends  to  get  closer  to  that  of  the  objective  function.  This 
points  to  the  need  of  embedding  the  procedure  into  an  enumerative  framework,  so  that  when¬ 
ever  the  cuts  become  “shallow”,  branching  can  be  performed  to  move  away  from  the  current 
LP  optimum.  Nevertheless,  it  is  a  remarkable  feature  of  our  procedure,  and  the  family  of 
cutting  planes  it  generates,  that  no  numerical  problems  were  encountered  (with  these  cuts) 
throughout  the  experiment. 
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